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Description 

BACKGROUND OF INVENTION 
5 1 . Field of Invention 

The present invention relates to the mutated farnesyldiphosphate synthase capable of synthesizing geranylgeran- , 
yidiphosphate and a process for production thereof, as well as genes coding for said mutated enzymes and a process 
for isolation thereof. 

10 

2. Related Art 

In nature there are various isoprenoid chain compounds comprising 5 carbon atom-basic structure, isoprene units, 
and these isoprenoid compounds play important roles for the life of various organisms. It is known that the chain-exten- 

15 sion mechanism is catalyzed by a series of prenyltransferases which catalyze a series of catalytic reactions comprising 
sequential condensation of isopentenyldiphosphate (IPP) having 5 carbon atoms with its isomer dimethylallyWiphos- 
phate (DMAPP). Among the isoprenoid compounds, farnesyldiphosphate (FPP) having 15 carbon atoms is positioned 
at a branching point in a biosynthesis pathway, from which various physiologically important start to geranylgeranyldi- 
phosphate (GGPP) having 20 carbon atoms, to quinones. squalene, to steroids, farnesylated protein, dolichol etc. 

20 Different prenyltransferases synthesize different isoprenoid compounds having different lengths. However, prenyl- 
transferases have a common activity to condense an isoprenoid unit to extend the chain, and in fact, amino acids 
essential for the condensation are being clarified on the basis of homology of amino acid sequences of different prenyl- 
transferases. However, the mechanism which determines the length of the isoprenoid compound have not yet clarified. 
A biosynthesis pathway for geranyldiphosphate (GPP), farnesyldiphosphate (FPP) and geranylgeranyldiphosphate 

25 (GGPP) starting from an isoprenoid unit is shown in Fig. 1. In this biosynthesis pathway, the prenyltransferase which 
synthesizes farnesyldiphosphate Is designated 'larnesyldiphosphate synthase", and the prenyltransferase which syn- 
thesizes geranylgeranyldiphosphate is designated "geranylgeranyldiphosphate synthase". 

Farnesyldiphosphate synthases are known in Bacillus thermophils (J. Biochem. 113. 355 - 363 (1993)), E. COlI (J. Bio- 
chem. 108, 995 - 1000 (1990)). yeast (J.B.C. ggg. 19176 - 19184 (1989)), rats (Mol. Cell. Biol. Z 3138 - 3146 (1987)) 
30 and in humans (J.B.C. 265, 4607 - 4616 (1990)). and their amino acid sequences are also known. 

On the other hand, geranylgeranyldiphosphate synthases are known in Rhodopseudomonas capusulata (J. Bacte- 
rid. 154. 580 - 590 f1983)). Enwnia uredovora (J. Bacteriol. 172, 6704 - 6712 (1990)), Sulfolobus actdocaldafius f J.B.C. 
269. 14792 - 14797 (1994)) etc. 

However, it had not been known that an enzyme having geranylgeranyldiphosphate synthase activity can be 
35 obtained by mutation of farnesyldiphosphate synthase. 

SUMMARY OF INVENTION 

Accordingly, the present invention provides a novel geranylgeranyldiphosphate synthase obtainable by mutating a 
40 farnesyldiphosphate synthase and a process for production thereof, as well as gene system therefor and a process for 
isolation of the gene. 

More specifically, the present invention provides a process for production of a gene coding for geranylgeranyldi- 
phosphate synthase comprising the steps of: 

45 (1) subjecting genes coding for a farnesyldiphosphate synthase to a mutagenesis; 

(2) expressing the genes subjected to the mutagenesis, and 

(3) selecting a gene which provides a geranylgeranyldiphosphate synthase. 

The present invention further provides a gene coding for geranylgeranyldiphosphate synthase, an expression vec- 
50 tor containing said gene, and a host transformed with said vector. 

The present invention also provides a process for production of geranylgeranyldiphosphate synthase comprising 
expressing said gene, and geranylgeranyldiphosphate synthase obtainable by said process. 

From another point of view, the present invention provides a geranylgeranyldiphosphate synthase having an amino 
acid sequence modified from an amino acid sequence of native farnesyldiphosphate synthase wherein the modification 
55 is deletion of one or more amino acids, addition of one or more amino acids, and/or replacement of one or more amino 
acids with other amino acids. 

The present invention still further provides a gene coding for the above-mentioned geranylgeranyldiphosphate syn- 
thase, a vector, especially an expression vector comprising said gene, and a host transformed with said vector. 
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The present invention further provides a process tor production of geranylgeranyWiphosphate synthase comprising 
the steps of cultivation said host, and purification the geranylgeranyldiphosphate synthase from the culture. 

The present invention further provides a process for production of geranylgeranyldiphosphate or geranylgeranyol. 
comprising the steps of acting the present geranylgeranyldiphosphate synthase on isopentenyldiphosphate. dimethyl- 
allyldiphosphate, geranyldiphosphate or farnesyldtphosphate as a substrate. 

BRIEF EXPLANATION OF DRAWINGS 

Figure 1 represents a biosynthesis pathway for farnesyldiphosphate and geranylgeranyldiphosphate. 

Fig. 2 shows the homology of amino acid sequences of farnesyldiphosphate synthase derived from different spe- 
cies. In this Figure, the sequences in the boxes A to E show regions having relatively high homology and which are 
expected to participate in enzyme activity. 

Fig. 3 shows the homology of amino acid sequences of farnesyldiphosphate synthase derived from different spe- 
cies. In this Figure, the sequences in the boxes F and G show regions having relatively high homology and which are 
expected to participate in enzyme activity 

Fig. 4 shows a native amino acid sequence of farnesyldiphosphate synthase derived from Bacillus stearother- 
mophilus (indicated as W.T), and the mutated points in amino acid sequences of the modified enzymes having geran- 
ylgeranyldiphosphate synthase activity (No. 1 to No. 4). 

Fig. 5 schematically shows a process for construction of the present modified gene. 

Fig. 6 is a profile of reversed phase TLC (developer: acetone/Water = 9/1) showing products formed by acting the 
present enzyme on a substrate dimethylallyldiphosphate. 

Fig. 7 is a profile of a reversed-phase TLC (developer: acetone/Water = 9/1) showing products formed by acting the 
present enzyme on a substrate geranyldiphosphate. 

Fig. 8 is a profile of a reversed phase TLC (developer: acetone/water = 9/1 ) showing products formed by acting the 
present enzyme on a substrate (all-E)-farnesyldiphosphate. 

Fig. 9 is a profile of a reversed phase TLC (developer: acetone/water = 9/1) showing products formed by acting the 
present enzyme on a substrate (all-E)-farnesyldiphosphate. 

DETAILED DESCRIPTION 

Genes of the present invention can be obtained by subjecting a gene coding for a farnesyldiphosphate synthase to 
mutagenesis, expressing the genes subjected to the mutagenesis, and selecting a gene providing a protein having ger- 
anylgeranyldiphosphate synthase activity. 

Genes coding for a farnesyldiphosphate synthase used in the present invention may be those of any origin. For 
example, farnesyldiphosphate synthases of E^cqL yeast, human, rat etc., as well as genes coding therefor are known, 
and amino acid sequences of these enzymes have high homology as shown in Fig. 2. Therefore, in addition to the gisne 
derived from BagillM^ stearothermophilus as described in detail, according to the present invention, any gene coding for 
an amino add sequence having a high homology, for example, at least 20% homology with the amino acid sequence of 
farnesyldiphosphate synthase derived from Bacillus stearothermophilus can be used regardless of its origin. As such 
gene sources, for example. Bacillus stearothermophilus. E.CQli, yeast, humans, rats etc. can be used. 

The gene to be mutated is an RNA or DNA coding for a farnesyldiphosphate synthase and sensitive to treatment 
with a mutagen, and DNA is preferably used for to ease of handling, and especially a single-stranded DNA is preferred 
due to its high mutation ratio. 

A single-stranded DNA can be easily prepared according to a conventional Procedure for preparing a single- 
stranded DNA, for example, by inserting a double-stranded DNA into a phage, introducing the phage into cqH cells, 
culturing the Ej. GQ& cells and recovering the phage from the resulting lysate solution; or by introducing a desired double- 
stranded DNA into host cells, infecting the host cells with helper phage, culturing the host cells and recovering the 
phage from the resulting lysate solution. 

Mutation of a gene can be carried out according to a conventional procedure for artificially mutating a gene. The 
mutation methods can be a physical method such as irradiation with X-rays, ultraviolet rays. etc.. a chemical method 
such as treatment with a mutagen, a method of cis incorporation by DNA polymerase, a method using synthetic oligo- 
nucleotides etc. A chemical method is preferable for ease of operation and a high mutation ratio. As a mutagen, a nitrite, 
such as sodium nitrite, or the like can be used. To mutate a single-stranded DNA, a nitrite is preferable. Mutagenesis is 
preferably carried out at a nitrite concentration of 0.01 to 2M. for example, at about 0.1 to 1 M. at a temperature of 20 to 
30**C, for 10 to 120 minutes. 

To select a gene coding for a protein having geranylgeranyldiphosphate synthase activity from the genes subjected 
to the mutagenesis, the gene subjected to the mutagenesis is inserted in an expression vector, the vector is introduced 
into host cells, the enzyme is expressed, and the expression product is tested for geranylgeranyldiphosphate synthase 
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activity. Geranylgeranyldiphosphate is converted to phytoene by a phytoene synthase, and the phytoene is converted 
to lycopene having red color by a phytoene desaturase. 

Accordingly, for example, a gene coding for a phytoene synthase and a gene coding for phytoene desaturase are 
inserted into an expression vector, the vector is introduced into host cells such as E. coli cells, and further an expression 

5 plasmid comprising a DNA to be tested is introduced into said host cells, and the double transformed host cells are cul- 
tured. If the gene to be tested encodes a geranylgeranyldiphosphate synthase, and the geranylgeranyldiphosphate pro- 
duced by the gene expression is converted to phytoene and further to lycopene. the cells are red-colored. Accordingly, 
a desired gene can be selected very easily and efficiently by selecting a red-colored colony. 

The present invention provides a protein having geranylgeranyldiphosphate synthase activity, i.e., a geranylgeran- 

10 yidiphosphate synthase, having an amino acid sequence modified from a native amino acid sequence of a farnesyldi- 
phosphate synthase. Here, the modification of an amino acid sequence means replacement of one or a few amino acids 
with other amino acids, deletion of one or a few amino acids or addition of one or a few amino acids, or a combination 
of these modifications. The amino acid replacement is especially preferable. Regarding the number of amino adds to 
be modified, "a few amino acids" means usually about 15 amino acids, preferably about 10 amino acids, and more pref- 

15 erably about 5 amino adds. Namely, according to the present invention, the number of mutated amino acids is about 1 
to 15. preferably about 1 to 10. and more preferably 1 to 5. 

To determine the positions of modified amino acids, after the mutagenesis and the selection of a gene coding for a 
geranylgeranyldiphosphate synthase, a nucleotide sequence of the selected gene is determined, and an amino acid 
sequence is predicted from the determined nudeotide sequence, the predicted amino acid sequence of the modified 

20 enzyme is composed with the corresponding native amino acid sequence. Amino acid sequences thus determined of 
the modified enzymes are shown in Fig. 4. 

In Fig. 4, the row indicated by the symbol WT shows, by the one-letter expression, a native amino add sequence 
of farnesyldlphosphate synthase of Bacillus stearothermophilus origin, and the rows Nos. 1 to 4 show representative 
amino acid sequences which acquired geranylgeranyldiphosphate synthase activity by amino add replacement in the 

25 amino acid sequence of the farnesyldiphosphate synthase, wherein only the amino acids different from the con-espond- 
ing amino acids in the native amino acid sequence of the farnesyldiphosphate synthase shown in the line TW are indi- 
cated by the one-letter expression of amino add. 

The modified enzyme No. 1 has two mutations, i.e., the 81st position (Tyr-^His) and 275th position (Leu->Ser); the 
modified enzyme No. 2 has two mutations, i.e., 34th position (Leu-Val) and 59th position (Arg->Gln); the modified 

30 enzyme No. 3 has two mutations, i.e.. 157th position (Val^Ala) and 182nd position (Hi&->Tyr); and the modified 
enzyme No. 4 has three mutations, i.e., 81st position (Tyr->His), 238th position (Pro^Arg) and 265th position (Afa^^ 
Thr). The amino acid sequences No. 1 to 4 of the above-mentioned modified enzymes and nucleotide sequences cod- 
ing therefor are shown in SEQ ID NO: 1 to 4, and the native amino acid sequence and a nucleotide sequence coding 
therefor is shown in SEQ ID NO: 5. ;>* 

35 In the present invention, the amino add sequence farnesyldiphosphate synthase of Bacillus stearothermophilus 
origin was used as a specific example. However, as shown in Figs. 2 and 3. farnesyldiphosphate synthases havejiigh 
homology among a wide spectrum of species covering those derived from the eukaryotes including humans andlho^e 
derived from prokaryotes including bacteria. Therefore, the present invention can be applied to enzymes derived from 
various species to obtain novel geranylgeranyldiphosphate synthase. 

40 As shown in Fig. 4. amino acid modification such as replacement occurs on the 34th. 59th, 81st, 157th. 182nd, 
239th. 265th, and/or 275th positions of farnesyldiphosphate of Bacillus stearothermophilus . For enzymes from other 
spedes, it is expected that replacement at positions corresponding to the above-mentioned positior^ of the farnesyldi- 
phosphate synthase of Bacillus stearothermophilus origin provides similar effects as that for the modified enzyme 
derived from BadllM? stearothermophilus . Therefore, the present invention can be applied to any farnesyldiphosphate 

45 synthases. 

The present invention also relates to genes coding for the various geranylgeranyldiphosphate synthases derived 
from a farnesyldiphosphate synthase. These genes can be obtained by mutation of a gene coding for a corresponding 
native amino acid sequence. In addition, once the position of mutated amino add is determined, a gene coding for the 
modified enzyme can be obtained by site-specific mutagenesis using a mutagenic primer. In addition, once an entire 
50 amino acid sequence is determined, a DNA coding for the amino acid sequence can be chemically synthesized accord- 
ing to a conventional procedure. 

Genes coding for farnesyldiphosphate synthases used as starting materials to obtain the present genes have been 
cloned from various organisms, and therefore they can be used. For example, a gene of Badilus stearothermophilus 
origin is described in J. Biochem. m. 355 - 363 (1993), a gene of E^ eoH origin is described in J. Biochem. Ifla 995 - 
55 1000 (1990), a gene of yeast origin is described in J.B.C. 2g4. 19176 - 19184 (1989), a gene of rat origin is described 
in Mol. Cell. Biol. 7, 3138 - 3146 (1987), and a gene of human origin is described in J.B.C. 265 . 4607 - 4614 (1990). 

The present invention further provides recombinant vectors, especially expression vectors, comprising the above- 
mentioned gene (DNA), recombinant host transformed with said vector, and a process for production of said enzyme 
using said recombinant host. 
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As an example, where cqH is used as a host, it is known that there are gene expression control mechanisms 
which regulate transcription of DNA to mRNA, translation of mRNA to protein etc. 

As promoter sequences which control the synthesis of mRNA. naturally occunring sequences such as lac, trp, bla. 
Ipp. PL, PR. tet. T3, T7 et al.. as well as mutants thereof, such as lacUVS, sequences prepared by fusing naturally 
occurring promoter sequences, such as tac. tra, etc. are known, and they can be used in the present invention. 

As sequences which control the ability to synthesize a protein from mRNA. it is known that a ritxjsome-binding site 
(GAGG and similar sequence) and the distance between the ribosome-binding site and the start codon ATG are impor- 
tant. In addition, it is known that a terminator which directs the termination of transcription at the 3*-end (for example, a 
vector comprising rrnBT1T2 is commercially available from Pharmacia) influences the efficiency of protein synthesis in 
a recombinant host 

As starting vectors to prepare recombinant vectors of the present invention, those commercially available can be 
used. Alternatively, various vectors derivatized according to a particular purpose can be used. For example. pBR322, 
pBR327. PKK223-2. pKK233-2. pTrc99A etc. containing a replicon derived from pMB1; pUClS. pUC19. pUCIIS, 
pUC119. PTV118N, PTV119N. pHSG298, pHSG396 etc.. which have been modified to increase copy number; 
PACYC177. pACYC184 etc. containing a replicon derived from p15A; as well as plasmids derived from pSC101, 
C01E1. R1 or F-factor, may be mentioned. 

Further, in addition to plasmids, viral vectors such as X phage. M13 phage etc.. and transposones can be used for 
introduction of a gene. These vectors are described in Molecular cloning (J. Sambrook. E.F. Fritsch. J. Maniatis, Cold 
Spring Harbor Laboratory Press); Cloning vector (P.H. Pouwels. B.E. Enger-Valk. W.J. Brammer. Elsevier); and cata- 
logs of manufacturers of vectors. 

Especially preferable is pTrc99 (commercially available for Pharmacia) which has an ampicillin resistance gene as 
a selective maker. Ptrc and lacF* as a promoter and control gene, an AGGA sequence as a ribosome-binding site and 
rrnBTlT2 as a terminator, and therefore has a function to control an expression of a geranylgeranyldiphosphate syn- 
thase. 

Introduction of a DNA coding for geranylgeranyldiphosphate synthase and if necessary DNA fragments having a 
function to control the expression of said gene into the above-mentioned vectors can be carried out using appropriate 
restriction enzymes and ligases according to a conventional procedure. 

Such a recombinant vector can be used to transform a microorganism such as Escherichia coli. Bacillus etc. Trans- 
formation can be carried out according to a conventional procedure, for exanrple by the CaCl2 method, protoplast 
method etc. described, for example, in Molecular cloning (J. Sambrook, E.F. Fritsch, T. Maniatis, Cold Spring Harbor 
Laboratory Press), DNA cloning Vol. I to III (D.M. Glover, IRLPRESS). 

Although methods for expression of the present gene in £. eoii was described in detail, according to the present 
invention, a DNA coding for a geranylgeranyldiphosphate synthase is inserted into a conventional expression vector 
according to a conventional procedure, and the vector Is used to transform a host, for example, prokaryotic cells such 
as various bacterial cells, lower eukaryotic cells for example single cell hosts, for example, yeast cells, or higher eukary- 
otic cells such as silk-worm. After transformation, the transformant is cultured to produce a geranylgeranyldiphosphate 
synthase, according to a conventional process. 

When a transformant host such as t con is cultured, geranylgeranyldiphosphate synthase is intracellularly accu- 
mulated. To recover the geranylgeranyldiphosphate from the cultured host cells, the cells are treated physiologically or 
chemically, for example, with a cell lysating agent to lyze the cells. The cell debris is removed, and the supernatant is 
subjected to an isolation process conventional for purification of enzymes. The above-mentioned cell-lysing enzyme is 
preferably lysozyme, and the physical treatment is preferably treatment with ultrasonic radiation. When the supernatant 
is heated to a temperature of about 55**C. proteins intrinsic to £.£Q!i are insolubilized and removed as an insoluble pre- 
cipitate. To purify the enzyme, gel-filtration chromatography, ion exchange chromatography, hydrophobic chromatogra- 
phy, reversed chromatography, and affinity chromatography can be used alone or in combination. During the purification 
and isolation steps, the desired enzyme can be stabilized by addition of a reducing agent such as dithiothreitol. protect- 
ing agent against proteases such as PMSF. BSA etc., metal ions such as magnesium, alone or in combination. 

The present invention further provides a process for production of geranylgeranyldiphosphate or geranylgeranyol. 
In this process, isopentenyldiphosphate. dimethylaltyldiphosphate. geranyldiphosphate, farnesyldiphosphate may be 
used as substrates. 

EXAMPt-gS 

Next, the present invention is explained in more detail by means of examples, though the present invention is not 
limited thereto. 
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Example 1. Construction of mutated genes (Fig. 5) 



The translation start codon in plasmid pFE15 (Japanese Unexamined Patent Publication (Kokai) No. 5-219761) 
containing a gene coding for fernesyldiphosphate synthase of Bacillus stearothermophilus origin was changed to ATG 
to obtain plasmid pEX1 1 (J. Biochem. 113, 355 - 363 (1993)) for overexpression of farnesyldiphosphate synthase, and 
the plasmid pEX1 1 was used In the following Examples. The mutation was carried out according to M. Myers et al. (Sci- 
ence, 229. 242 - 247 (1985)). 

First, a farnesyldiphosphate synthase gene present in Ncol-Hindlll fragment in pEX1 1 was removed, and inserted 
it into plasmid pTVI 18N (available from Takara Shuzo. Japan) to construct a plasmid, which was then introduced into 
CQli cells. The transformed eoli cells were cultured. With infection of a helper phage M13K07 (available from 
Takara Shuzo). pTVI 18N is converted to a single-stranded DNA and preferentially incorporated in phage particles and 
liberated out of cells. The culture was centrifuged to obtain a supernatant, from which the single-stranded DNA was 
recovered. 

The single-stranded DNA thus recovered was subjected to mutation with sodium nitrite (concentration 1 M or 0.2M) 
to introduce random mutation Into the single-stranded DNA, which was then restored to a double-stranded DNA using 
AMV reverse-transcriptase XL (E.C.2.7.7.7). This farnesyldiphosphate synthase gene fragment was introduced into 
pTrc99A (available for Pharmacia) and pTV118N. and resulting recombinant plasmids were used to transform B cdi 
into which a phytoene synthase gene and phytoene desaturase gene had been previously introduced, and red colonies 
were selected. The principle of the selection is as follows. 

The following screening method follows Ohnuma et al. (J. Biol. Chem.. gga. 14792 - 14797 (1994)). E SQli harbor- 
ing a plasmid pACYC-IB, into which crtB (phytoene synthase gene) and crtl (phytoene desaturase gene) of a phy- 
topathogen Emm. uredQVQra origin had been introduced, was transformed with the mutant plasmid. Note that at 
present it is believed that CQli does not have a geranylgeranykJiphosphate synthase. If the mutant plasmid encodes 
geranylgeranyldiphosphate synthase activity, lycopene having red color is produced in E^ ^ cells by pACYC-IB result- 
ing in formation of red-colored colonies. However, if the mutant plasmid does not encode geranylgeranyldiphosphate 
synthase activity, colonies are color-less. In this way, geranylgeranyldiphosphate synthase activity was easily detected 
by visual observation. 

As a result of transformation of the CQli cells with the mutant plasmid, red colonies were detected. The ratio of 
positive clones was 1 .32 x 10'^ (10 colonies per 7,600 colonies) when the mutation was carried out using 1 M NaN02, 
while the ratio of positive clones was 5.98 x 10'^ (one colony per 16.720 colonies) when the mutation was carried out 
using 0.2M NaNOa. revealing that the higher the concentration of NaNOa. the higher the positive ratio. 

Among the positive colonies, four colonies were selected, and a nucleotide sequence of an enzyme-coding region 
in the plasmid was determined, and an amino acid sequence encoded by the nucleotide sequence was determined, for 
each positive clone. The result is shown in SEQ ID NOs: 1 to 4. In addition, these amino acid sequences were com- 
pared with the native amino acid sequence, and positions of the mutation are shown in Fig. 4. 

Four mutated enzymes encoded by four mutant genes were further characterized. 

Example 2. Production of mutated enzvmes 

CQli transformed with the mutant plasmid was cultured in LB medium at 37*'C overnight. The culture was centri- 
fuged at 3,000 X G. at 4^0 for 5 minutes to collect cells, which were then suspended in a buffer for sonication (50 mM 
Tris-HCI (pH 7.0), 10 mM 2-mercaptoethanol, 1 mM EDTA). The suspension was subjected to ultrasonic waves to dis- 
rupt the cells. The sonicate was centrifuged at 5,000 x g, at a temperature of 4**C for 20 minutes, to obtain a superna- 
tant, which was then heated at 55**C for one hour to inactivate enzymes intrinsic to coH to obtain a crude enzyme 
extract. 
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To test the enzymatic activity of each mutant enzyme, reactions were can-led out in the following reaction mixture. 



Table 1 



[1-14C]IPP (1 Ci/mol)) 


25 nmol 


Allyl substrate (DMAPP. GPP, FPP) 


25 nmol 


MgCl2 


5 ^mol 


NH4C! 


50 \imo\ 


2-Mercaptoethanol 


50 ^mol 


TrIs-HCI buffer (pH 8.5) 


50 ^mol 


Sample to be tested 


proper quantity 


Total 


1 ml 


Note: 

DMAPP: Dimethylallyldiphosphate 
GPP: Geranyldiphosphate 
FPP: Farnesyldiphosphate 



The reaction mixture was incubated at 55*^0 for 30 minutes, and the product was extracted with water-saturated 1 - 
butanol, and radioactivity of the extract was counted by a liquid scintillation counter. In addition, the extract (butanol 
layer) was treated with an acid phosphatase and extracted with pentane. The extract was analyzed by TLC. The TLC 
analysis showed that the use of dimethylallyldiphosphate and geranyldiphosphate as an allyl substrate provides similar 
TLC patterns. Note that since the amount of each sample was adjusted so that the radioactivity is approximately same 
between the samples, the density of the band does not indicate specific activity 

The modified enzymes Nos. 1 and 4 produced an amount of geranylgeranyldiphosphate more than that of far- 
nesyldiphosphate. and therefore it is considered that the modified enzymes Nos. 1 and 4 are suitable for the production 
of geranylgeranyldiphosphate. On the other hand, the modified enzymes No. 2 and No, 3 provided a small amount of 
geranylgeranyldiphosphate. 

Where (all-E)-farnesyldiphosphate was used as a substrate (primer), (all-E)-geranylgeranyldiphosphate was 
formed. The results are shown in Figs. 6 to 9. 

Specific activity and ratio of product (GGOH/FOH) are shown in Table 2. 



Table 2 





Specific activity'' (nmol/min/mg protein) 


Ratio of product (GGPP/FPP) 


Wild type 


286 


0 


No. 1 


pTVIISN 


0.293 


18.4 




pTrc99A 


0.253 


6.28 


No. 2 


pTVIISN 


110 


2.95 X 10-2 




pTrc99A 


83 


2.54 X 10-2 


No. 3 


PTV118N 


143 


1.65 X lO "" 




pTrc99A 


19.7 


1.73x10-^ 


No. 4 


pTV118N 


0.262 


15.5 




pTrc99A 


0.271 


8.28 



*DMAPP was used as substrate. 
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SEQUENCE LISTING 
SEQ ID NO: 1 

5 

SEQUENCE LENGTH: 894 
SEQUENCE TYPE: Nucleic acid 
STRANDNESS : Double 
TOPOLOGY: Linear 
MOLECULAR TYPE: 

15 

SOURCE : Bacillus stearothermophilus 
CHARACTERISTIC: Mutant (1) of DNA coding for 
20 f arnesyldiphosphate synthase 

SEQUENCE 

ATG GCG GAG CTT TCA GTT GAA GAG TTT CTC AAC GAG CAA AAA GAG GCG 48 
Met Ala Gin Leu Ser Val Glu Gin Phe Leu Asn Glu Gin Lys Gin Ala 

5 10 15 

GTG GAA ACA GCG CTC TCC CGT TAT ATA GAG CGC TTA GAA GGG COG GCG 96 
Val Glu Thr Ala Leu Ser Arg Tyr lie Glu Arg Leu Glu Gly Pro Ala 

20 25 30 

AAG CTG AAA AAG GCG ATG GCG TAC TCA TTG GAG GCC GGC GGC AAA CGA 1A4 
Lys Leu Lys Lys Ala Met Ala Tyr Ser Leu Glu Ala Gly Gly Lys Arg 

35 AO - 45 

ATC CGT GCG TTG CTG CTT CTG TCC ACC GTT CGG GCG CTC GGA AAA GAC 192 
lie Arg Pro Leu Leu Leu Leu Ser Thr Val Arg Ala Leu Gly Lys Asp 

30 55 60 

CCG GCG GTC GGA TTG CCC GTC GCC TGC GCG ATT GAA ATG ATC CAT ACG 240 
Pro Ala Val Gly Leu Pro Val Ala Cys Ala lie Glu Met lie His Thr 
65 70 75 80 

CAC TCT TTG ATC CAT GAT GAT TTG CCG AGC ATG GAC AAC GAT GAT TTG 288 
His Ser Leu lie His Asp Asp Leu Pro Ser Met Asp Asn Asp Asp Leu 
85 90 95 



55 



30 



35 



40 



45 



SO 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



CGG CGC GGC AAG CCG ACG AAC CAT AAA GTG TTC GGC GAG GCG ATG GCC 336 
Arg Arg Gly Lys Pro Thr Asn His Lys Val Phe Gly Glu Ala Met Ala 

100 105 110 

ATG TTG GCG GGG GAG GGG TTG TTG ACG TAG GCG TTT CAA TTG ATC ACC 38 A 

lie Leu Ala Gly Asp Gly Leu Leu Thr Tyr Ala Phe Gin Leu lie Thr 

115 120 125 

GAA ATC GAC GAT GAG CGC ATC CCT CCT TCC GTC CGG CTT CGG CTC ATC 432 
Glu lie Asp Asp Glu Arg lie Pro Pro Ser Val Arg Leu Arg Leu lie 

130 135 140 

GAA CGG CTG GCG AAA GCG GCC GGT CCG GAA GGG ATG GTC GCC GGT GAG 480 
Glu Arg Leu Ala Lys Ala Ala Gly Pro Glu Gly Met Val Ala Gly Gin 
1A5 150 155 160 

GCA GCC GAT ATG GAA GGA GAG GGG AAA ACG CTG ACG CTT TCG GAG CTC 528 
Ala Ala Asp Met Glu Gly Glu Gly Lys Thr Leu Thr Leu Ser Glu Leu 

165 170 175 

GAA TAG ATT CAT CGG CAT AAA ACC GGG AAA ATG CTG CAA TAG AGC GTG 576 
Glu Tyr He His Arg His Lys Thr Gly Lys Met Leu Gin Tyr Ser Val 

180 185 190 

CAC GCC GGC GCC TTG ATC GGC GGC GCT GAT GCC CGG CAA ACG CGG GAG 624 
His Ala Gly Ala Leu He Gly Gly Ala Asp Ala Arg Gin Thr Arg Glu 

195 200 205 

CTT GAC GAA TTC GCC GCC CAT CTA GGC CTT GCC TTT CAA ATT CGC GAT 672 
Leu Asp Glu Phe Ala Ala His Leu Gly Leu Ala Phe Gin He Arg Asp 

210 215 220 

GAT ATT CTC GAT ATT GAA GGG GCA GAA GAA AAA ATC GGC AAG CCG GTC 720 
Asp lie Leu Asp He Glu Gly Ala Glu Glu Lys He Gly Lys Pro Val 
225 230 235 240 

GGC AGC GAC CAA AGC AAC AAC AAA GCG ACG TAT CCA GCG TTG CTG TCG 768. 
Gly Ser Asp Gin Ser Asn Asn Lys Ala Thr Tyr Pro Ala Leu Leu Ser 
245 250 255 
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10 



IS 



20 



25 



CTT GCC GGC GCG AAG GAA AAG TTG GCG TTC CAT ATC GAG GCG GCG GAG 816 
Leu Ala Gly Ala Lys Glu Lys Leu Ala Phe His lie Glu Ala Ala Gin 

260 265 270 

GGC CAT TCA CGG AAC GCC GAC GTT GAG GGC GCC GCG CTC GCC TAT ATT 864 
Arg His Ser Arg Asn Ala Asp Val Asp Gly Ala Ala Leu Ala Tyr lie 

275 280 285 

TGC GAA CTG GTC GCC GCC GGC GAC CAT TAA 894 
Cys Glu Leu Val Ala Ala Arg Asp His *** 

290 295 
SEQ ID NO: 2 

SEQUENCE LENGTH: 894 
SEQUENCE TYPE: Nucleic acid 
STRANDNESS: Double 
TOPOLOY: Linear 
MOLECULAR TYPE: 
30 SOURCE: Bacillus stearothermophilus 

CHARACTERISTIC: Mutant (2) of DNA coding for 
f arnesyldiphosphate systhase 
SEQUENCE 

ATG GCG CAG CTT TCA GTT GAA GAG TTT CTC AAC GAG CAA AAA GAG GCG A 8 

Met Ala Gin Leu Ser Val Glu Gin Phe Leu Asn Glu Gin Lys Gin Ala 

5 10 15 

GTG GAA ACA GCG CTC TCC CGT TAT ATA GAG GGC TTA GAA GGG GCG GCG 96 
45 Glu Thr Ala Leu Ser Arg Tyr lie Glu Arg Leu Glu Gly Pro Ala 

20 25 30 

AAG GTG AAA AAG GCG ATG GCG TAG TCA TTG GAG GCC GGC GGC AAA CGA 144 
Lys Val Lys Lys Ala Met Ala Tyr Ser Leu Glu Ala Gly Gly Lys Arg 
35 AO A5 



35 



40 



50 
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ATC CGT CCG TTG 
lie Arg Pro Leu 
50 

CCG GCG GTC GGA 
Pro Ala Val Gly 
65 

TAG TCT TTG ATC 
Tyr Ser Leu lie 

CGG CGC GGC AAG 
Arg Arg Gly Lys 
100 

ATC TTG GCG GGG 
lie Leu Ala Gly 
115 

GAA ATC GAC GAT 
Glu lie Asp Asp 
130 

GAA CGG CTG GCG 
Glu Arg Leu Ala 
1A5 

GCA GCC GAT ATG 
Ala Ala Asp Met 

GAA TAC ATT CAT 
Glu Tyr lie His 
180 

CAC GCC GGC GCC 
His Ala Gly Ala 
195 



CTG CTT CTG TCC 
Leu Leu Leu Ser 
55 

TTG CCC GTC GCC 
Leu Pro Val Ala 
70 

CAT GAT GAT TTG 
His Asp Asp Leu 
85 

CCG ACG AAC CAT 
Pro Thr Asn His 

GAC GGG TTG TTG 
Asp Gly Leu Leu 
120 

GAG CGC ATC CCT 
Glu Arg lie Pro 
135 

AAA GCG GCC GGT 
Lys Ala Ala Gly 
150 

GAA GGA GAG GGG 
Glu Gly Glu Gly 
165 

CGG CAT AAA ACC 
Arg His Lys Thr 

TTG ATC GGC GGC 
Leu lie Gly Gly 
200 



ACC GTT CAG GCG 
Thr Val Gin Ala 
60 

TGC GCG ATT GAA 
Cys Ala lie Glu 
75 

CCG AGC ATG GAC 
Pro Ser Met Asp 
90 

AAA GTG TTC GGC 
Lys Val Phe Gly 
105 

ACG TAC GCG TTT 
Thr Tyr Ala Phe 

CCT TCC GTC CGG 
Pro Ser Val Arg 
140 

CCG GAA GGG ATG 
Pro Glu Gly Met 
155 

AAA ACG CTG ACG 
Lys Thr Leu Thr 
170 

GGG AAA ATG CTG 
Gly Lys Met Leu 
185 

GCT GAT GCC CGG 
Ala Asp Ala Arg 



CTC GGC AAA GAC 
Leu Gly Lys Asp 

ATG ATC CAT ACG 
Met He His Thr 
80 

AAC GAT GAT TTG 
Asn Asp Asp Leu 
95 

GAG GCG ATG GCC 
Glu Ala Met Ala 
110 

CAA TTG ATC ACC 
Gin Leu He Thr 
125 

CTT CGG CTC ATC 
Leu Arg Leu He 

GTC GCC GGT CAG 
Val Ala Gly Gin 
160 

CTT TCG GAG CTC 
Leu Ser Glu Leu 
175 

CAA TAC AGC GTG 
Gin Tyr Ser Val 
190 

CAA ACG CGG GAG 
Gin Thr Arg Glu 
205 
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CTT GAC GAA TTC GCC GCC CAT CTA GGC CTT GCC TTT CAA ATT CGC GAT 672 
Leu Asp Glu Phe Ala Ala His Leu Gly Leu Ala Phe Gin lie Arg Asp 

5 

210 215 220 

GAT ATT CTC GAT ATT GAA GGG GCA GAA GAA AAA ATC GGC AAG CCG GTC 720 
Asp lie Leu Asp lie Glu Gly Ala Glu Glu Lys lie Gly Lys Pro Val 

10 

225 230 235 240 

GGC AGC GAC CAA AGC AAC AAC AAA GCG ACG TAT CCA GCG TTG CTG TCG 768 

Gly Ser Asp Gin Ser Asn Asn Lys Ala Thr Tyr Pro Ala Leu Leu Ser 

75 

245 250 255 

CTT GCC GGC GCG AAG GAA AAG TTG GCG TTC CAT ATC GAG GCG GCG CAG 816 
Leu Ala Gly Ala Lys Glu Lys Leu Ala Phe His lie Glu Ala Ala Gin 

20 

260 265 270 

CGC CAT TTA CGG AAC GCC GAC GTT GAC GGC GCC GCG CTC GCC TAT ATT 864 
Arg His Leu Arg Asn Ala Asp Val Asp Gly Ala Ala Leu Ala Tyr lie 

25 

275 280 285 

TGC GAA CTG GTC GCC GCC CGC GAC CAT TAA 894 
Cys Glu Leu Val Ala Ala Arg Asp His *** 

30 

290 295 
SEQ ID NO: 3 

35 SEQUENCE LENGTH: 894 

SEQUENCE TYPE: Nucleic acid 

STRANDNESS: Double 

40 

TOPOLOY: Linear 
MOLECULAR TYPE: 
45 SOURCE: Bacillus stearothermophilus 

CHARACTERISTIC: Mutant (3) of DNA coding for 
f arnesyldiphosphate systhase 

so 

SEQUENCE 
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ATG GCG GAG CTT TCA GTT GAA GAG TTT CTC AAC GAG CAA AAA GAG GCG 48 
Met Ala Gin Leu Ser Val Glu Gin Phe Leu Asn Glu Gin Lys Gin Ala 

5 10 15 

GTG GAA ACA GCG CTC TCC CGT TAT ATA GAG CGC TTA GAA GGG GCG GCG 96 
Val Glu Thr Ala Leu Ser Arg Tyr He Glu Arg Leu Glu Gly Pro^ Ala 

20 25 30 

AAG GTG AAA AAG GCG ATG GCG TAG TCA TTG GAG GCG GGC GGC AAA CGA 144 
Lys Leu Lys Lys Ala Met Ala Tyr Ser Leu Glu Ala Gly Gly Lys Arg 

35 40 45 

ATC CGT GCG TTG CTG CTT CTG TCC ACC GTT CGG GCG CTC GGA AAA GAG 192 
He Arg Pro Leu Leu Leu Leu Ser Thr Val Arg Ala Leu Gly Lys Asp 

50 55 60 

CGG GCG GTG GGA TTG CCC GTG GCG TGC GCG ATT GAA ATG ATC CAT ACG 240 
Pro Ala Val Gly Leu Pro Val Ala Cys Ala He Glu Met He His Thr 
65 70 75 80. 

TAG TCT TTG ATC CAT GAT GAT TTG CGG AGC ATG GAC AAC GAT GAT TTG 288 
Tyr Ser Leu He His Asp Asp Leu Pro Ser Met Asp Asn Asp Asp Leu 

85 90 95 

CGG CGC GGC AAG CCG ACG AAC CAT AAA GTG TTC GGC GAG GCG ATG GGC 336 
Arg Arg Gly Lys Pro Thr Asn His Lys Val Phe Gly Glu Ala Met Ala 

100 105 110 

ATC TTG GCG GGG GAC GGG TTG TTG ACG TAG GCG TTT CAA TTG ATC ACC 384 
He Leu Ala Gly Asp Gly Leu Leu Thr Tyr Ala Phe Gin Leu lie Thr 

115 120 125 

GAA ATC GAC GAT GAG CGC ATC CGT CCT TCC GTG CGG CTT CGG CTC ATC 43 2 

^ Glu He Asp Asp Glu Arg He Pro Pro Ser Val Arg Leu Arg Leu He 

130 135 140 

GAA CGG CTG GCG AAA GCG GCC GGT CCG GAA GGG ATG GCC GCG GGT CAG 480 . 

Glu Arg Leu Ala Lys Ala Ala Gly Pro Glu Gly Met Ala Ala Gly Gin 
1^5 150 155 160 
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25 



GCA GCC GAT ATG GAA GGA GAG GGG AAA ACG CTG ACG CTT TOG GAG CTC 528 
Ala Ala Asp Met Glu Gly Glu Gly Lys Thr Leu Thr Leu Ser Glu Leu 
^ 165 170 175 

GAA TAG ATT CAT GGG TAT AAA ACC GGG AAA ATG CTG CAA TAC AGC GTG 576 
Glu Tyr lie His Arg Tyr Lys Thr Gly Lys Met Leu Gin Tyr Ser Val 
10 180 185 190 

CAC GCC GGC GCC TTG ATC GGC GGC GCT GAT GCC CGG CAA ACG CGG GAG 62 A 

His Ala Gly Ala Leu lie Gly Gly Ala Asp Ala Arg Gin Thr Arg Glu 
15 195 200 205 

CTT GAC GAA TTC GCC GCC CAT CTA GGC CTT GCC TTT CAA ATT CGC GAT 672 
Leu Asp Glu Phe Ala Ala His Leu Gly Leu Ala Phe Gin lie Arg Asp 

210 215 220 

GAT ATT CTC GAT ATT GAA GGG GCA GAA GAA AAA ATC GGC AAG CCG GTC 720 
Asp lie Leu Asp lie Glu Gly Ala Glu Glu Lys lie Gly Lys Pro Val 
225 230 235 240 

GGC AGC GAC CAA AGC AAC AAC AAA GCG ACG TAT CCA GCG TTG CTG TCG 768 
Gly Ser Asp Gin Ser Asn Asn Lys Ala Thr Tyr Pro Ala Leu Leu Ser 

245 250 255 

CTT GCC GGC GCA AAG GAA AAG TTG GCG TTC CAT ATC GAG GCG GCG GAG 816 
Leu Ala Gly Ala Lys Glu Lys Leu Ala Phe His lie Glu Ala Ala Gin 

260 265 270 

CGC CAT TTA CGG AAC GCC GAC GTT GAC GGC GCC GCG CTC GCC TAT ATT 864 
Arg His Leu Arg Asn Ala Asp Val Asp Gly Ala Ala Leu Ala Tyr lie 

275 280 285 

TGC GAA CTG GTC GCC GCC CGC GAC CAT TAA 894 
Cys Glu Leu Val Ala Ala Arg Asp His *** 

290 295 
SEQ ID NO: 4 
SEQUENCE LENGTH: 894 
SEQUENCE TYPE: Nucleic acid 

so 

STRANDNESS: Double 
TOPOLOY: Linear 
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MOLECUIiAR TYPE: 

SOURCE : Bacillus stearothermophilus 
CHARACTERISTIC: Mutant (4) of DNA coding for 
f arnesyldiphosphate systhase 
SEQUENCE 

ATG GCG CAG CTT TCA GTT GAA CAG TTT CTC AAC GAG CAA AAA CAG GCG 48 
Met Ala Gin Leu Ser Val Glu Gin Phe Leu Asn Glu Gin Lys Gin Ala 

5 10 15 

GTG GAA ACA GCG CTC TCC CGT TAT ATA GAG CGC TTA GAA GGG CCG GCG 96 
Val Glu Thr Ala Leu Ser Arg Tyr lie Glu Arg Leu Glu Gly Pro Ala 

20 25 30 

AAG CTG AAA AAG GCG ATG GCG TAC TCA TTG GAG GCC GGC GGC AAA CGA 144 
Lys Leu Lys Lys Ala Met Ala Tyr Ser Leu Glu Ala Gly Gly Lys Arg 

35 40 45 

ATC CGT CCG TTG CTG CTT CTG TCC ACC GTT CGG GCG CTC GGC AAA GAC 192 
lie Arg Pro Leu Leu Leu Leu Ser Thr Val Arg Ala Leu Gly Lys Asp 

50 55 60 

CCG GCG GTC GGA TTG CCC GTC GCC TGC GCG ATT GAA ATG ATC CAT ACG 240 
Pro Ala Val Gly Leu Pro Val Ala Cys Ala He Glu Met He His Thr 
65 70 75 80 

35 GAC TCT TTG ATC CAT GAT GAT TTG CCG AGC ATG GAC AAC GAT GAT TTG 288 

His Ser Leu He His Asp Asp Leu Pro Ser Met Asp Asn Asp Asp Leu 
85 90 95 

40 CGG CGC GGC AAG CCG ACG AAC CAT AAA GTG TTC GGC GAG GCG ATG GCC 336 

Arg Arg Gly Lys Pro Thr Asn His Lys Val Phe Gly Glu Ala Met Ala 

100 105 110 

ATC TTG GCG GGG GAC GGG TTG TTG ACG TAC GCG TTT CAA TTG ATC ACC 384. 
He Leu Ala Gly Asp Gly Leu Leu Thr Tyr Ala Phe Gin Leu He Thr 

115 , 120 125 

GAA ATC GAC GAT GAG CGC ATC CCT CCT TCC GTC CGG CTT CGG CTC ATC 432 
Glu He Asp Asp Glu Arg He Pro Pro Ser Val Arg Leu Arg Leu He 
130 135 140 
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GAA CGG GTG GCG AAA GCG GCC GGT CCG GAA GGG ATG GTC GCC GGT GAG £^80 
Glu Arg Leu Ala Lys Ala Ala Gly Pro Glu Gly Met Val Ala Gly Gin 
^ 1A5 150 155 160 

GCA GCC GAT ATG GAA GGA GAG GGG AAA ACG CTG ACG CTT TCG GAG CTC 528 
Ala Ala Asp Met Glu Gly Glu Gly Lys Thr Leu Thr Leu Ser Glu Leu 

165 170 175 

GAA TAG ATT CAT CGG CAT AAA ACC GGG AAA ATG CTG CAA TAG AGC GTG 576 
Glu Tyr lie His Arg His Lys Thr Gly Lys Met Leu Gin Tyr Ser Val 
75 180 185 190 

CAC GCC GGC GCC TTG ATC GGC GGC GCT GAT GCC CGG CAA ACG CGG GAG 624 
His Ala Gly Ala Leu lie Gly Gly Ala Asp Ala Arg Gin Thr Arg Glu 
20 195 200 205 

CTT GAG GAA TTC GCC GCC CAT CTA GGC CTT GCC TTT CAA ATT CGG GAT 672 
Leu Asp Glu Phe Ala Ala His Leu Gly Leu Ala Fhe Gin lie Arg Asp 

210 215 220 

GAT ATT CTC GAT ATT GAA GGG GCA GAA GAA AAA ATC GGC AAG CGG GTC 720 
Asp lie Leu Asp lie Glu Gly Ala Glu Glu Lys lie Gly Lys Arg Val 
225 230 235 240 

GGC AGC GAG CAA AGC AAC AAC AAA GCG ACG TAT CCA GCG TTG CTG TCG 768 
Gly Ser Asp Gin Ser Asn Asn Lys Ala Thr Tyr Pro Ala Leu Leu Ser 

245 250 255 

CTT GCC GGC GCG AAG GAA AAG TTG ACG TTC CAT AJC GAG GCG GCG GAG 816 
Leu Ala Gly Ala Lys Glu Lys Leu Thr Phe His lie Glu Ala Ala Gin 

260 265 270 

CGC CAT TTA CGG AAC GCC GAC GTT GAC GGC GCC GCG CTC GCC TAT ATT 864 
Arg His Leu Arg Asn Ala Asp Val Asp Gly Ala Ala Leu Ala Tyr lie 

275 280 285 

TGC GAA CTG GTC GCC GCC CGC GAC CAT TAA 894- 
Cys Glu Leu Val Ala Ala Arg Asp His *** 

290 295 
SEQ ID NO: 5 
SEQUENCE LENGTH: 894 
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SEQUENCE TYPE: Nucleic acid 
STRANDNESS : Double 
TOPOLOY: Linear 
MOLECULAR TYPE: 

SOURCE : Bacillus s tearothermophilus 

CHARACTERISTIC: DNA coding for native f arnesyldiphosphate 

synthase 

SEQUENCE 

ATG GCG CAG CTT TCA GTT GAA GAG TTT CTC AAC GAG CAA AAA GAG GCG 48 
Met Ala Gin Leu Ser Val Glu Gin Phe Leu Asn Glu Gin Lys Gin Ala 

5 10 15 

GTG GAA ACA GCG CTC TCC CGT TAT ATA GAG CGC TTA GAA GGG CCG GCG 96 
Val Glu Thr Ala Leu Ser Arg Tyr lie Glu Arg Leu Glu Gly Pro Ala 

20 25 30 

AAG CTG AAA AAG GCG ATG GCG TAG TCA TTG GAG GCC GGC GGC AAA CGA 144 
Lys Lys Lys Lys Ala Met Ala Tyr Ser Leu Glu Ala Gly Gly Lys Arg 

35 40 45 

ATC CGT CCG TTG CTG CTT CTG TCC ACC GTT CAG GCG CTC GGC AAA GAC 192 
lie Arg Pro Leu Leu Leu Leu Ser Thr Val Gin Ala Leu Gly Lys Asp 

50 55 60 

CCG GCG GTC GGA TTG CCC GTC GCC TGC GCG ATT GAA ATG ATC CAT ACG 240 
Pro Ala Val Gly Leu Pro Val Ala Cys Ala He Glu Met He His Thr 
65 70 75 80 

TAC TCT TTG ATC CAT GAT GAT TTG CCG AGC ATG GAC AAC GAT GAT TTG 288 
Tyr Ser Leu He His Asp Asp Leu Pro Ser Met Asp Asn Asp Asp Leu 

85 90 95 

CGG CGC GGC AAG CCG ACG AAC CAT AAA GTG TTC GGC GAG GCG ATG GCC 336 
Arg Arg Gly Lys Pro Thr Asn His Lys Val Phe Gly Glu Ala Met Ala 
100 105 110 
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ATC TTG GCG GGG 
lie Leu Ala Gly 
115 

GAA ATC GAG GAT 
Glu lie Asp Asp 
130 

GAA CGG CTG GCG 
Glu Arg Leu Ala 
1A5 

GCA GCC GAT ATG 
Ala Ala Asp Met 

GAA TAG ATT CAT 
Glu Tyr He His 
180 

CAC GCC GGG GCC 
His Ala Gly Ala 
195 

CTT GAC GAA TTC 
Leu Asp Glu Phe 
210 

GAT ATT CTC GAT 
Asp He Leu Asp 
225 

GGC AGC GAC CAA 
Gly Ser Asp Gin 

CTT GCC GGC GCG 
Leu Ala Gly Ala 
260 

CGC CAT TTA CGG 
Arg His Leu Arg 
275 



GAC GGG TTG TTG 
Asp Gly Leu Leu 
120 

GAG CGC ATC CCT 
Glu Arg He Pro 
135 

AAA GCG GCC GGT 
Lys Ala Ala Gly 
150 

GAA GGA GAG GGG 
Glu Gly Glu Gly 
165 

CGG CAT AAA ACC 
Arg His Lys Thr 

TTG ATC GGC GGC 
Leu He Gly Gly 
200 

GCC GCC CAT CTA 
Ala Ala His Leu 

215 

ATT GAA GGG GCA 
He Glu Gly Ala 
230 

AGC AAC AAC AAA 
Ser Asn Asn Lys 
245 

AAG GAA AAG TTG 
Lys Glu Lys Leu 

AAC GCC GAC GTT 
Asn Ala Asp Val 
280 



ACG TAG GCG TTT 
Thr Tyr Ala Phe 

CCT TCC GTC CGG 
Pro Ser Val Arg 
140 

CGG GAA GGG ATG 
Pro Glu Gly Met 
155 

AAA ACG CTG ACG 
Lys Thr Leu Thr 
170 

GGG AAA ATG CTG 
Gly Lys Met Leu 
185 

GCT GAT GCC CGG 
Ala Asp Ala Arg 

GGC CTT GCC TTT 
Gly Leu Ala Phe 
220 

GAA GAA AAA ATC 
Glu Glu Lys He 
235 

GCG ACG TAT CCA 
Ala Thr Tyr Pro 
250 

GCG TTC CAT ATC 
Ala Phe His He 
265 

GAC GGC GCC GCG 
Asp Gly Ala Ala 



CAA TTG ATC ACC 
Gin Leu He Thr 
125 

CTT CGG CTC ATC 
Leu Arg Leu He 

GTC GCC GGT GAG 
Val Ala Gly Gin 
160 

CTT TCG GAG CTC 
Leu Ser Glu Leu 
175 

CAA TAG AGC GTG 
Gin Tyr Ser Val 
190 

CAA ACG CGG GAG 
Gin Thr Arg Glu 
205 

CAA ATT CGC GAT 
Gin He Arg Asp 

GGC AAG CGG GTC 
Gly Lys Pro Val 
240 

GCG TTG CTG TCG 
Ala Leu Leu Ser 
255 

GAG GCG GCG GAG 
Glu Ala Ala Gin 
270 

CTC GCC TAT ATT 
Leu Ala Tyr He 
285 
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TGC GAA CTG GTC GCC GCC CGC GAC CAT TAA 
Cys Glu Leu Val Ala Ala Arg Asp His *** 
290 295 



894 



A mutated farnesyldiphosphate synthase capable of synthesizing geranylgeranyldiphosphate and gene coding for 
said mutated enzyme, wherein the mutated enzyme is modified from a native farnesyldiphosphate synthase by muta- 
tion of a gene coding for a native farnesyldiphosphate synthase. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Toyota Jidosha Kabushiki Kaisha 

(B) STREET: 1, Toyota -cho 

(C) CITY: Toyota-shi 

(D) STATE: Aichi 

(E) COUNTRY: Japan 

(F) POSTAL CODE (ZIP) : None 

(ii) TITLE OF INVENTION: MUTATED FARNESYLD I PHOSPHATE SYNTHASE CAPABLE 
OF SYNTHESIZING GERANYLGERANYL DIPHOSPHATE AND GENE CODING 
THEREFOR 

(iii) NUMBER OF SEQUENCES: 10 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.25 (EPO) 

(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: EP 95115423.6 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus stearothermophilus 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..894 

(D) OTHER INFORMATION: /function^ "Mutant (1) of DNA 
coding for farnesyldiphosphate synthase* 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

ATG GCG CAG CTT TCA GTT GAA CAG TTT CTC AAC GAG CAA AAA CAG GCG 48 

Met Ala Gin Leu Ser Val Glu Gin Phe Leu Asn Glu Gin Lys Gin Ala 
1 5 10 15 

GTG GAA ACA GCG CTC TCC CGT TAT ATA GAG CGC TTA GAA GGG CCG GCG 96 

Val Glu Thr Ala Leu Ser Arg Tyr lie Glu Arg Leu Glu Gly Pro Ala 
20 25 30 
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AAG CTG AAA AAG GCG ATG GCG TAC TCA TTG GAG GCC GGC GGC AAA CGA 
Lys Leu Lys Lys Ala Met Ala Tyr Ser Leu Glu Ala Gly Gly Lys Arg 
35 40 45 

ATC CGT CCG TTG CTG CTT CTG TCC ACC GTT CGG GCG CTC GGA AAA GAC 
lie Arg Pro Leu Leu Leu Leu Ser Thr Val Arg Ala Leu Gly Lys Asp 
50 55 60 

CCG GCG GTC GGA TTG CCC GTC GCC TGC GCG ATT GAA ATG ATC CAT ACG 
Pro Ala Val Gly Leu Pro Val Ala Cys Ala He Glu Met He His Thr 
65 70 75 80 

CAC TCT TTG ATC CAT GAT GAT TTG CCG AGC ATG GAC AAC GAT GAT TTG 
His Ser Leu He His Asp Asp Leu Pro Ser Met Asp Asn Asp Asp Leu 
85 90 95 

CGG CGC GGC AAG CCG ACG AAC CAT AAA GTG TTC GGC GAG GCG ATG GCC 
Arg Arg Gly Lys Pro Thr Asn His Lys Val Phe Gly Glu Ala Met Ala 
100 105 110 

ATC TTG GCG GGG GAC GGG TTG TTG ACG TAC GCG TTT CAA TTG ATC ACC 
He Leu Ala Gly Asp Gly Leu Leu Thr Tyr Ala Phe Gin Leu He Thr 
115 120 125 

GAA ATC GAC GAT GAG CGC ATC CCT CCT TCC GTC CGG CTT CGG CTC ATC 
Glu He Asp Asp Glu Arg He Pro Pro Ser Val Arg Leu Arg Leu He 
130 135 140 

GAA CGG CTG GCG AAA GCG GCC GGT CCG GAA GGG ATG GTC GCC GGT CAG 
Glu Arg Leu Ala Lys Ala Ala Gly Pro Glu Gly Met Val Ala Gly Gin 
145 150 155 160 

GCA GCC GAT ATG GAA GGA GAG GGG AAA ACG CTG ACG CTT TCG GAG CTC 
Ala Ala Asp Met Glu Gly Glu Gly Lys Thr Leu Thr Leu Ser Glu Leu 
165 170 175 

GAA TAC ATT CAT CGG CAT AAA ACC GGG AAA ATG CTG CAA TAC AGC GTG 
Glu Tyr He His Arg His Lys Thr Gly Lys Met Leu Gin Tyr Ser Val 
180 185 190 

CAC GCC GGC GCC TTG ATC GGC GGC GCT GAT GCC CGG CAA ACG CGG GAG 
His Ala Gly Ala Leu He Gly Gly Ala Asp Ala Arg Gin Thr Arg Glu 
195 200 205 

CTT GAC GAA TTC GCC GCC CAT CTA GGC CTT GCC TTT CAA ATT CGC GAT 
Leu Asp Glu Phe Ala Ala His Leu Gly Leu Ala Phe Gin He Arg Asp 
210 215 220 

GAT ATT CTC GAT ATT GAA GGG GCA GAA GAA AAA ATC GGC AAG CCG GTC 
Asp He Leu Asp He Glu Gly Ala Glu Glu Lys He Gly Lys Pro Val 
225 230 235 240 

GGC AGC GAC CAA AGC AAC AAC AAA GCG ACG TAT CCA GCG TTG CTG TCG 
Gly Ser Asp Gin Ser Asn Asn Lys Ala Thr Tyr Pro Ala Leu Leu Ser 
245 250 255 
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CTT GCC GGC GCG AAG GAA AAG TTG GCG TTC CAT ATC GAG GCG GCG CAG 
Leu Ala Gly Ala Lys Glu Lys Leu Ala Phe His lie Glu Ala Ala Gin 
260 265 270 

CGC CAT TCA CGG AAC GCC GAC GTT GAC GGC GCC GCG CTC GCC TAT ATT 
Arg His Ser Arg Asn Ala Asp Val Asp Gly Ala Ala Leu Ala Tyr lie 
275 280 285 



TGC GAA CTG GTC GCC GCC CGC GAC CAT TAA 
Cys Glu Leu Val Ala Ala Arg Asp His 
290 295 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 297 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Gin Leu Ser Val Glu Gin Phe Leu Asn Glu Gin Lys Gin Ala 
15 10 15 

Val Glu Thr Ala Leu Ser Arg Tyr He Glu Arg Leu Glu Gly Pro Ala 
20 25 30 

Lys Leu Lys Lys Ala Met Ala Tyr Ser Leu Glu Ala Gly Gly Lys Arg 
35 40 45 

He Arg Pro Leu Leu Leu Leu Ser Thr Val Arg Ala Leu Gly Lys Asp 
50 55 60 

Pro Ala Val Gly Leu Pro Val Ala Cys Ala He Glu Met He His Thr 
65 70 75 80 

His Ser Leu He His Asp Asp Leu Pro Ser Met Asp Asn Asp Asp Leu 
85 90 95 

Arg Arg Gly Lys Pro Thr Asn His Lys Val Phe Gly Glu Ala Met Ala 
100 105 110 

He Leu Ala Gly Asp Gly Leu Leu Thr Tyr Ala Phe Gin Leu He Thr 
115 120 125 

Glu He Asp Asp Glu Arg He Pro Pro Ser Val Arg Leu Arg Leu lie 
130 135 140 

Glu Arg Leu Ala Lys Ala Ala Gly Pro Glu Gly Met Val Ala Gly Gin 
145 150 155 160 

Ala Ala Asp Met Glu Gly Glu Gly Lys Thr Leu Thr Leu Ser Glu Leu 
165 170 175 

Glu Tyr He His Arg His Lys Thr Gly Lys Met Leu Gin Tyr Ser Val 
ISO 185 190 

His Ala Gly Ala Leu lie Gly Gly Ala Asp Ala Arg Gin Thr Arg Glu 
195 200 205 

Leu Asp Glu Phe Ala Ala His Leu Gly Leu Ala Phe Gin He Arg Asp 
210 215 220 

Asp He Leu Asp He Glu Gly Ala Glu Glu Lys He Gly Lys Pro Val 
225 230 235 240 

Gly Ser Asp Gin Ser Asn Asn Lys Ala Thr Tyr Pro Ala Leu Leu Ser 
245 250 255 
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Leu Ala Gly Ala Lys Glu Lys Leu Ala Phe His lie Glu Ala Ala Gin 
260 265 270 

Arg His Ser Arg Asn Ala Asp Val Asp Gly Ala Ala Leu Ala Tyr lie 
275 280 285 

Cys Glu Leu Val Ala Ala Arg Asp His 
290 295 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



10 



15 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus stearothermophilus 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..894 

(D) OTHER INFORMATION: /function- "Mutant (2) of DNA 
coding for f arnesyldiphosphate synthase" 



20 



26 



35 



40 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG GCG CAG CTT TCA GTT GAA CAG TTT CTC AAC GAG CAA AAA CAG GCG 48 

Met Ala Gin Leu Ser Val Glu Gin Phe Leu Asn Glu Gin Lys Gin Ala 

1 5 10 15 

GTG GAA ACA GCG CTC TCC CGT TAT ATA GAG CGC TTA GAA GGG CCG GCG 96 

Val Glu Thr Ala Leu Ser Arg Tyr lie Glu Arg Leu Glu Gly Pro Ala 
20 25 30 



AAG GTG AAA AAG GCG ATG GCG TAG TCA TTG GAG GCC GGC GGC AAA CGA 144 
Lys Val Lys Lys Ala Met Ala Tyr Ser Leu Glu Ala Gly Gly Lys Arg 

30 35 40 45 

ATC CGT CCG TTG CTG CTT CTG TCC ACC GTT CAG GCG CTC GGC AAA GAC 192 
lie Arg Pro Leu Leu Leu Leu Ser Thr Val Gin Ala Leu Gly Lys Asp 

50 55 60 



CCG GCG GTC GGA TTG CCC GTC GCC TGC GCG ATT GAA ATG ATC CAT ACG 240 
Pro Ala Val Gly Leu Pro Val Ala Cys Ala He Glu Met He His Thr 
65 70 75 80 

TAC TCT TTG ATC CAT GAT GAT TTG CCG AGC ATG GAC AAC GAT GAT TTG 288 
Tyr Ser Leu He His Asp Asp Leu Pro Ser Met Asp Asn Asp Asp Leu 
85 90 95 

CGG CGC GGC AAG CCG ACG AAC CAT AAA GTG TTC GGC GAG GCG ATG GCC 336 
Arg Arg Gly Lys Pro Thr Asn His Lys Val Phe Gly Glu Ala Met Ala 
100 105 110 

ATC TTG GCG GGG GAC GGG TTG TTG ACG TAC GCG TTT CAA TTG ATC ACC 384 
He Leu Ala Gly Asp Gly Leu Leu Thr Tyr Ala Phe Gin Leu He Thr 
115 120 125 

GAA ATC GAC GAT GAG CGC ATC CCT CCT TCC GTC CGG CTT CGG CTC ATC 432 
Glu He Asp Asp Glu Arg He Pro Pro Ser Val Arg Leu Arg Leu He 
130 135 140 
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10 



15 



GAA CGG CTG GCG AAA GCG GCC GGT CCG GAA GGG ATG GTC GCC GGT CAG 480 
Glu Arg Leu Ala Lys Ala Ala Gly Pro Glu Gly Met Val Ala Gly Gin 
145 150 155 160 

GCA GCC GAT ATG GAA GGA GAG GGG AAA ACG CTG ACG CTT TCG GAG CTC 528 
Ala Ala Asp Met Glu Gly Glu Gly Lys Thr Leu Thr Leu Ser Glu Leu 
165 170 175 

GAA TAC ATT CAT CGG CAT AAA ACC GGG AAA ATG CTG CAA TAG AGC GTG 576 
Glu Tyr lie His Arg His Lys Thr Gly Lys Met Leu Gin Tyr Ser Val 
180 185 190 

CAC GCC GGC GCC TTG ATC GGC GGC GCT GAT GCC CGG CAA ACG CGG GAG 624 
His Ala Gly Ala Leu lie Gly Gly Ala Asp Ala Arg Gin Thr Arg Glu 
195 200 205 



CTT GAC GAA TTC GCC GCC CAT CTA GGC CTT GCC TTT CAA ATT CGC GAT 672 
Leu Asp Glu Phe Ala Ala His Leu Gly Leu Ala Phe Gin He Arg Asp 
20 210 215 220 

GAT ATT CTC GAT ATT GAA GGG GCA GAA GAA AAA ATC GGC AAG CCG GTC 720 
Asp He Leu Asp He Glu Gly Ala Glu Glu Lys He Gly Lys Pro Val 
225 230 235 240 

25 

GGC AGC GAC CAA AGC AAC AAC AAA GCG ACG TAT CCA GCG TTG CTG TCG 768 
Gly Ser Asp Gin Ser Asn Asn Lys Ala Thr lyr Pro Ala Leu Leu Ser 
245 250 255 

30 CTT GCC GGC GCG AAG GAA AAG TTG GCG TTC CAT ATC GAG GCG GCG CAG 816 

Leu Ala Gly Ala Lys Glu Lys Leu Ala Phe His He Glu Ala Ala Gin 
260 265 270 

CGC CAT TTA CGG AAC GCC GAC GTT GAC GGC GCC GCG CTC GCC TAT ATT 864 
35 Arg His Leu Arg Asn Ala Asp Val Asp Gly Ala Ala Leu Ala Tyr He 

275 280 285 

TGC GAA CTG GTC GCC GCC CGC GAC CAT TAA 894 
Cys Glu Leu Val Ala Ala Arg Asp His 
40 290 295 



45 



50 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 297 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Ala Gin Leu Ser Val Glu Gin Phe Leu Asn Glu Gin Lys Gin Ala 
1 5 10 15 

Val Glu Thr Ala Leu Ser Arg Tyr lie Glu Arg Leu Glu Gly Pro Ala 
20 25 30 

Lys Val Lys Lys Ala Met Ala Tyr Ser Leu Glu Ala Gly Gly Lys Arg 
35 40 45 . 

lie Arg Pro Leu Leu Leu Leu Ser Thr Val Gin Ala Leu Gly Lys Asp 
50 55 60 

Pro Ala Val Gly Leu Pro Val Ala Cys Ala lie Glu Met lie His Thr 
65 70 75 80 

Tyr Ser Leu lie His Asp Asp Leu Pro Ser Met Asp Asn Asp Asp Leu 
85 90 95 

Arg Arg Gly Lys Pro Thr Asn His Lys Val Phe Gly Glu Ala Met Ala 
100 105 110 

lie Leu Ala Gly Asp Gly Leu Leu Thr Tyr Ala Phe Gin Leu He Thr 
115 120 125 

Glu He Asp Asp Glu Arg He Pro Pro Ser Val Arg Leu Arg Leu He 
130 135 140 

Glu Arg Leu Ala Lys Ala Ala Gly Pro Glu Gly Met Val Ala Gly Gin 
145 150 155 160 

Ala Ala Asp Met Glu Gly Glu Gly Lys Thr Leu Thr Leu Ser Glu Leu 
165 170 175 

Glu Tyr He His Arg His Lys Thr Gly Lys Met Leu Gin Tyr Ser Val 
180 185 190 

His Ala Gly Ala Leu He Gly Gly Ala Asp Ala Arg Gin Thr Arg Glu 
195 200 205 

Leu Asp Glu Phe Ala Ala His Leu Gly Leu Ala Phe Gin He Arg Asp 
210 215 220 

Asp He Leu Asp He Glu Gly Ala Glu Glu Lys He Gly Lys Pro Val 
225 230 235 240 

Gly Ser Asp Gin Ser Asn Asn Lys Ala Thr Tyr Pro Ala Leu Leu Ser 
245 250 255 
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Leu Ala Gly Ala Lys Glu Lys Leu Ala Phe His lie Glu Ala Ala Gin 
260 265 270 

^ Arg His Leu Arg Asn Ala Asp Val Asp Gly Ala Ala Leu Ala Tyr lie. 

275 280 285 

Cys Glu Leu Val Ala Ala Arg Asp His 
290 295 

10 



15 



20 



25 



30 



35 



40 



45 



50 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus stearothermophilus 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..894 

(D) OTHER INFORMATION: /function* "Mutant (3) of DNA 

coding for farnesyldiphosphate synthase" 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATG GCG CAG CTT TCA GTT GAA CAG TTT CTC AAC GAG CAA AAA CAG GCG 
Met Ala Gin Leu Ser Val Glu Gin Phe Leu Asn Glu Gin Lys Gin Ala 
15 10 15- 

GTG GAA ACA GCG CTC TCC CGT TAT ATA GAG CGC TTA GAA GGG CCG GCG 
Val Glu Thr Ala Leu Ser Arg Tyr lie Glu Arg Leu Glu Gly Pro Ala 
20 25 30 

AAG CTG AAA AAG GCG ATG GCG TAC TCA TTG GAG GCC GGC GGC AAA CGA 
Lys Leu Lys Lys Ala Met Ala Tyr Ser Leu Glu Ala Gly Gly Lys Arg 
35 40 45 

ATC CGT CCG TTG CTG CTT CTG TCC ACC GTT CGG GCG CTC GGA AAA GAC 
lie Arg Pro Leu Leu Leu Leu Ser Thr Val Arg Ala Leu Gly Lys Asp 
50 55 60 

CCG GCG GTC GGA TTG CCC GTC GCC TGC GCG ATT GAA ATG ATC CAT ACG 
Pro Ala Val Gly Leu Pro Val Ala Cys Ala lie Glu Met lie His Thr 
65 70 75 80 

TAC TCT TTG ATC CAT GAT GAT TTG CCG AGC ATG GAC AAC GAT GAT TTG 
Tyr Ser Leu lie His Asp Asp Leu Pro Ser Met Asp Asn Asp Asp Leu 
85 90 95 

CGG CGC GGC AAG CCG ACG AAC CAT AAA GTG TTC GGC GAG GCG ATG GCC 
Arg Arg Gly Lys Pro Thr Asn His Lys Val Phe Gly Glu Ala Met Ala 
100 105 110 

ATC TTG GCG GGG GAC GGG TTG TTG ACG TAC GCG TTT CAA TTG ATC ACC 
He Leu Ala Gly Asp Gly Leu Leu Thr Tyr Ala Phe Gin Leu He Thr 
115 120 125 

GAA ATC GAC GAT GAG CGC ATC CCT CCT TCC GTC CGG CTT CGG CTC ATC 
Glu He Asp Asp Glu Arg He Pro Pro Ser Val Arg Leu Arg Leu He 
130 135 140 
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GAA CGG CTG GCG AAA GCG GCC GGT CCG GAA GGG ATG GCC GCC GGT CAG 
Glu Arg Leu Ala Lys Ala Ala Gly Pro Glu Gly Met Ala Ala Gly Gin 
145 150 155 160 

GCA GCC GAT ATG GAA GGA GAG GGG AAA ACG CTG ACG CTT TCG GAG CTC 
Ala Ala Asp Met Glu Gly Glu Gly Lys Thr Leu Thr Leu Ser Glu Leu 
165 170 175 

GAA TAC ATT CAT CGG TAT AAA ACC GGG AAA ATG CTG CAA TAC AGC GTG 
Glu Tyr lie His Arg Tyr Lys Thr Gly Lys Met Leu Gin Tyr Ser Val 
180 185 190 

CAC GCC GGC GCC TTG ATC GGC GGC GCT GAT GCC CGG CAA ACG CGG GAG 
His Ala Gly Ala Leu He Gly Gly Ala Asp Ala Arg Gin Thr Arg Glu 
195 200 205 

CTT GAC GAA TTC GCC GCC CAT CTA GGC CTT GCC TTT CAA ATT CGC GAT 
Leu Asp Glu Phe Ala Ala His Leu Gly Leu Ala Phe Gin He Arg Asp 
210 215 220 

GAT ATT CTC GAT ATT GAA GGG GCA GAA GAA AAA ATC GGC AAG CCG GTC 
Asp He Leu Asp He Glu Gly Ala Glu Glu Lys He Gly Lys Pro Val 
225 230 235 240 

GGC AGC GAC CAA AGC AAC AAC AAA GCG ACG TAT CCA GCG TTG CTG TCG 
Gly Ser Asp Gin Ser Asn Asn Lys Ala Thr Tyr Pro Ala Leu Leu Ser 
245 250 255 

CTT GCC GGC GCA AAG GAA AAG TTG GCG TTC CAT ATC GAG GCG GCG CAG 
Leu Ala Gly Ala Lys Glu Lys Leu Ala Phe His He Glu Ala Ala Gin 
260 265 270 

CGC CAT TTA CGG AAC GCC GAC GTT GAC GGC GCC GCG CTC GCC TAT ATT 
Arg His Leu Arg Asn Ala Asp Val Asp Gly Ala Ala Leu Ala Tyr He 
275 280 285 

TGC GAA CTG GTC GCC GCC CGC GAC CAT TAA 
Cys Glu Leu Val Ala Ala Arg Asp His 
290 295 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 297 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ala Gin Leu Ser Val Glu Gin Phe Leu Asn Glu Gin Lys Gin Ala 
1 5 10 15 

Val Glu Thr Ala Leu Ser Arg Tyr lie Glu Arg Leu Glu Gly Pro Ala 
20 25 30 

Lys Leu Lys Lys Ala Met Ala Tyr Ser Leu Glu Ala Gly Gly Lys Arg 
35 40 45 

lie Arg Pro Leu Leu Leu Leu Ser Thr Val Arg Ala Leu Gly Lys Asp 
50 55 60 

Pro Ala Val Gly Leu Pro Val Ala Cys Ala He Glu Met He His Thr 
65 70 75 80 

Tyr Ser Leu He His Asp Asp Leu Pro Ser Met Asp Asn Asp Asp Leu 
85 90 95 

Arg Arg Gly Lys Pro Thr Asn His Lys Val Phe Gly Glu Ala Met Ala 
100 105 110 

He Leu Ala Gly Asp Gly Leu Leu Thr Tyr Ala Phe Gin Leu He Thr 
115 120 125 

Glu He Asp Asp Glu Arg He Pro Pro Ser Val Arg Leu Arg Leu He 
130 135 140 

Glu Arg Leu Ala Lys Ala Ala Gly Pro Glu Gly Met Ala Ala Gly Gin 
145 150 155 160 

Ala Ala Asp Met Glu Gly Glu Gly Lys Thr Leu Thr Leu Ser Glu Leu 
165 170 175 

Glu Tyr He His Arg Tyr Lys Thr Gly Lys Met Leu Gin Tyr Ser Val 
180 185 190 

His Ala Gly Ala Leu He Gly Gly Ala Asp Ala Arg Gin Thr Arg Glu 
195 200 205 

Leu Asp Glu Phe Ala Ala His Leu Gly Leu Ala Phe Gin He Arg Asp 
210 215 220 

Asp He Leu Asp He Glu Gly Ala Glu Glu Lys He Gly Lys Pro Val 
225 230 235 240 

Gly Ser Asp Gin Ser Asn Asn Lys Ala Thr Tyr Pro Ala Leu Leu Ser 
245 250 255 
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Leu Ala Gly Ala Lys Glu 
260 

^ Arg His Leu Arg Asn Ala 

275 

Cys Glu Leu Val Ala Ala 
290 

10 
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Lys Leu Ala Phe His lie Glu Ala Ala Gin 
265 270 

Asp Val Asp Gly Ala Ala Leu Ala Tyr lie 
280 285 

Arg Asp His 
295 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus stearothermophilus 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..894 

(D) OTHER INFORMATION: /function^ "Mutant (4) of DMA 
coding for f arnesyldiphosphate synthase** 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATG GCG CAG CTT TCA GTT GAA CAG TTT CTC AAC GAG CAA AAA CAG GCG 
Met Ala Gin Leu Ser Val Glu Gin Phe Leu Asn Glu Gin Lys Gin Ala 
15 10 15 

GTG GAA ACA GCG CTC TCC CGT TAT ATA GAG CGC TTA GAA GGG CCG GCG 
Val Glu Thr Ala Leu Ser Arg Tyr He Glu Arg Leu Glu Gly Pro Ala 
20 25 30 

AAG CTG AAA AAG GCG ATG GCG TAC TCA TTG GAG GCC GGC GGC AAA CGA 
Lys Leu Lys Lys Ala Met Ala Tyr Ser Leu Glu Ala Gly Gly Lys Arg 
35 40 45 

ATC CGT CCG TTG CTG CTT CTG TCC ACC GTT CGG GCG CTC GGC AAA GAC 
He Arg Pro Leu Leu Leu Leu Ser Thr Val Arg Ala Leu Gly Lys Asp 
50 55 60 

CCG GCG GTC GGA TTG CCC GTC GCC TGC GCG ATT GAA ATG ATC CAT ACG 
Pro Ala Val Gly Leu Pro Val Ala Cys Ala He Glu Met He His Thr 
65 70 75 80 

CAC TCT TTG ATC CAT GAT GAT TTG CCG AGC ATG GAC AAC GAT GAT TTG 
His Ser Leu He His Asp Asp Leu Pro Ser Met Asp Asn Asp Asp Leu 
85 90 95 

CGG CGC GGC AAG CCG ACG AAC CAT AAA GTG TTC GGC GAG GCG ATG GCC 
Arg Arg Gly Lys Pro Thr Asn His Lys Val Phe Gly Glu Ala Met Ala 
100 105 110 

ATC TTG GCG GGG GAC GGG TTG TTG ACG TAC GCG TTT CAA TTG ATC ACC 
He Leu Ala Gly Asp Gly Leu Leu Thr Tyr Ala Phe Gin Leu He Thr 
115 120 125 

GAA ATC GAC GAT GAG CGC ATC CCT CCT TCC GTC CGG CTT CGG CTC ATC 
Glu He Asp Asp Glu Arg He Pro Pro Ser Val Arg Leu Arg Leu He 
130 135 140 
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GAA CGG CTG GCG AAA GCG GCC GGT CCG GAA GGG ATG GTC GCC GGT GAG 480 
Glu Arg Leu Ala Lys Ala Ala Gly Pro Glu Gly Met Val Ala Gly Gin 
145 150 155 160 

GCA GCC GAT ATG GAA GGA GAG GGG AAA ACG CTG ACG CTT TCG GAG CTG 528 
Ala Ala Asp Met Glu Gly Glu Gly Lys Thr Leu Thr Leu Ser Glu Leu 
165 170 175 

GAA TAC ATT CAT CGG CAT AAA ACC GGG AAA ATG CTG CAA TAG AGC GTG 576 
Glu Tyr He His Arg His Lys Thr Gly Lys Met Leu Gin Tyr Ser Val 
180 185 190 

CAC GCC GGC GCC TTG ATC GGC GGC GCT GAT GCC CGG CAA ACG CGG GAG 624 
His Ala Gly Ala Leu He Gly Gly Ala Asp Ala Arg Gin Thr Arg Glu 
195 200 205 

CTT GAC GAA TTC GCC GCC CAT CTA GGC CTT GCC TTT CAA ATT CGC GAT 672 
Leu Asp Glu Phe Ala Ala His Leu Gly Leu Ala Phe Gin He Arg Asp 
210 215 220 

GAT ATT CTC GAT ATT GAA GGG GCA GAA GAA AAA ATC GGC AAG CGG GTC 720 
Asp He Leu Asp He Glu Gly Ala Glu Glu Lys He Gly Lys Arg Val 
225 230 235 240 

GGC AGC GAC CAA AGC AAC AAC AAA GCG ACG TAT CCA GCG TTG CTG TCG 768 
Gly Ser Asp Gin Ser Asn Asn Lys Ala Thr Tyr Pro Ala Leu Leu Ser 
245 250 255 

CTT GCC GGC GCG AAG GAA AAG TTG ACG TTC CAT ATC GAG GCG GCG CAG 816 
Leu Ala Gly Ala Lys Glu Lys Leu Thr Phe His He Glu Ala Ala Gin 
260 265 270 

CGC CAT TTA CGG AAC GCC GAC GTT GAC GGC GCC GCG CTC GCC TAT ATT 864 
Arg His Leu Arg Asn Ala Asp Val Asp Gly Ala Ala Leu Ala Tyr He 
275 280 285 

TGC GAA CTG GTC GCC GCC CGC GAC CAT TAA 894 
Cys Glu Leu Val Ala Ala Arg Asp His 
290 295 



45 



50 
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(2) INFORMATION FOR SBQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 297 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Ala Gin Leu Ser Val Glu Gin Phe Leu Asn Glu Gin Lys Gin Ala 
15 10 15 

Val Glu Thr Ala Leu Ser Arg Tyr lie Glu Arg Leu Glu Gly Pro Ala 
20 25 30 

Lys Leu Lys Lys Ala Met Ala Tyr Ser Leu Glu Ala Gly Gly Lys Arg 
35 40 45 

lie Arg Pro Leu Leu Leu Leu Ser Thr Val Arg Ala Leu Gly Lys Asp 
50 55 60 

Pro Ala Val Gly Leu Pro Val Ala Cys Ala He Glu Met He His Thr 
65 70 75 80 

His Ser Leu He His Asp Asp Leu Pro Ser Met Asp Asn Asp Asp Leu 
85 90 95 

Arg Arg Gly Lys Pro Thr Asn His Lys Val Phe Gly Glu Ala Met Ala 
100 105 1X0 

He Leu Ala Gly Asp Gly Leu Leu Thr Tyr Ala Phe Gin Leu He Thr 
. 115 120 125 

Glu He Asp Asp Glu Arg He Pro Pro Ser Val Arg Leu Arg Leu He 
130 135 140 

Glu Arg Leu Ala Lys Ala Ala Gly Pro Glu Gly Met Val Ala Gly Gin 
145 150 155 160 

Ala Ala Asp Met Glu Gly Glu Gly Lys Thr Leu Thr Leu Ser Glu Leu 
165 170 175 

Glu Tyr He His Arg His Lys Thr Gly Lys Met Leu Gin Tyr Ser Val 
180 185 190 

His Ala Gly Ala Leu He Gly Gly Ala Asp Ala Arg Gin Thr Arg Glu 
195 200 205 

Leu Asp Glu Phe Ala Ala His Leu Gly Leu Ala Phe Gin He Arg Asp 
210 215 220 

Asp He Leu Asp He Glu Gly Ala Glu Glu Lys He Gly Lys Arg Val 
225 230 235 240 

Gly Ser Asp Gin Ser Asn Asn Lys Ala Thr Tyr Pro Ala Leu Leu Ser 
245 250 255 
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Leu Ala Gly Ala Lys Glu Lys Leu Thr Phe His lie Glu Ala Ala Gin 
260 265 270 

^ Arg His Leu Arg Asn Ala Asp Val Asp Gly Ala Ala Leu Ala Tyr lie 

275 280 285 

Cys Glu Leu Val Ala Ala Arg Asp His 
290 295 
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(2) INFORMATION FOR SEQ ID NO: 9: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNBSS : double 
<D) TOPOLOGY: linear 



(vi) ORIGINAL SOURCE 

10 



(A) ORGANISM: Bacillus stearothermophilus 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..894 

^5 (D) OTHER INFORMATION: /function^ "DNA coding for native 

farnesyldiphosphate synthase" 



20 



25 



40 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

ATG GCG CAG CTT TCA GTT GAA CAG TTT CTC AAC GAG CAA AAA CAG GCG 48 
Met Ala Gin Leu Ser Val Glu Gin Phe Leu Asn Glu Gin Lys Gin Ala 
15 10 15 

GTG GAA ACA GCG CTC TCC CGT TAT ATA GAG CGC TTA GAA GGG CCG GCG 96 
Val Glu Thr Ala Leu Ser Arg Tyr He Glu Arg Leu Glu Gly Pro Ala 
20 25 30 



AAG CTG AAA AAG GCG ATG GCG TAC TCA TTG GAG GCC GGC GGC AAA CGA 144 
Lys Lys Lys Lys Ala Met Ala Tyr Ser Leu Glu Ala Gly Gly Lys Arg 
30 35 40 45 

ATC CGT CCG TTG CTG CTT CTG TCC ACC GTT CAG GCG CTC GGC AAA GAC 192 
He Arg Pro Leu Leu Leu Leu Ser Thr Val Gin Ala Leu Gly Lys Asp 
50 55 60 

3S 

CCG GCG GTC GGA TTG CCC GTC GCC TGC GCG ATT GAA ATG ATC CAT ACG 240 
Pro Ala Val Gly Leu Pro Val Ala Cys Ala He Glu Met He His Thr 
65 70 75 80 

TAC TCT TTG ATC CAT GAT GAT TTG CCG AGC ATG GAC AAC GAT GAT TTG 288 
lyr Ser Leu He His Asp Asp Leu Pro Ser Met Asp Asn Asp Asp Leu 
85 90 95 

CGG CGC GGC AAG CCG ACG AAC CAT AAA GTG TTC GGC GAG GCG ATG GCC , 336 

Arg Arg Gly Lys Pro Thr Asn His Lys Val Phe Gly Glu Ala Met Ala 
100 105 110 

ATC TTG GCG GGG GAC GGG TTG TTG ACG TAC GCG TTT CAA TTG ATC ACC 384 
He Leu Ala Gly Asp Gly Leu Leu Thr Tyr Ala Phe Gin Leu He Thr 
115 120 125 

SO GAA ATC GAC GAT GAG CGC ATC CCT CCT TCC GTC CGG CTT CGG CTC ATC 432 

Glu He Asp Asp Glu Arg He Pro Pro Ser Val Arg Leu Arg Leu He 
130 135 140 
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GAA CGG CTG GCG AAA GCG GCC GGT CCG GAA GGG ATG GTC GCC GGT. GAG 480 
Glu Arg Leu Ala Lys Ala Ala Gly Pro Glu Gly Met Val Ala Gly Gin 
145 150 155 160 

5 

GCA GCC GAT ATG GAA GGA GAG GGG AAA ACG CTG ACG CTT TCG GAG CTC 528 
Ala Ala Asp Met Glu Gly Glu Gly Lys Thr Leu Thr Leu Ser Glu Leu 
165 170 175 

10 GAA TAC ATT CAT CGG CAT AAA ACC GGG AAA ATG CTG CAA TAG AGC GTG 576 

Glu Tyr lie His Arg His Lys Thr Gly Lys Met Leu Gin Tyr Ser Val 
180 185 190 

CAC GCC GGC GCC TTG ATC GGC GGC GCT GAT GCC CGG CAA ACG CGG GAG 624 
75 His Ala Gly Ala Leu lie Gly Gly Ala Asp Ala Arg Gin Thr Arg Glu 

195 200 205 

CTT GAC GAA TTC GCC GCC CAT CTA GGC CTT GCC TTT CAA ATT CGC GAT 672 
Leu Asp Glu Phe Ala Ala His Leu Gly Leu Ala Phe Gin lie Arg Asp 
20 210 215 220 

GAT ATT CTC GAT ATT GAA GGG GCA GAA GAA AAA ATC GGC AAG CCG GTC 720 

Asp lie Leu Asp lie Glu Gly Ala Glu Glu Lys lie Gly Lys Pro Val 
225 230 235 240 

25 

GGC AGC GAC CAA AGC AAC AAC AAA GCG ACG TAT CCA GCG TTG CTG TCG 768 

Gly Ser Asp Gin Ser Asn Asn Lys Ala Thr Tyr Pro Ala Leu Leu Ser 
245 250 255 

30 CTT GCC GGC GCG AAG GAA AAG TTG GCG TTC CAT ATC GAG GCG GCG CAG 816 

Leu Ala Gly Ala Lys Glu Lys Leu Ala Phe His lie Glu Ala Ala Gin 
260 265 270 

CGC CAT TTA CGG AAC GCC GAC GTT GAC GGC GCC GCG CTC GCC TAT ATT 864 
55 Arg His Leu Arg Asn Ala Asp Val Asp Gly Ala Ala Leu Ala Tyr lie 

275 280 285 

TGC GAA CTG GTC GCC GCC CGC GAC CAT TAA 894 
Cys Glu Leu Val Ala Ala Arg Asp His 
40 290 295 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 297 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ala Gin Leu Ser Val Glu Gin Phe Leu Asn Glu Gin Lys Gin Ala 
15 10 15 

Val Glu Thr Ala Leu Ser Arg Tyr He Glu Arg Leu Glu Gly Pro Ala 
20 25 30 

Lys Lys Lys Lys Ala Met Ala Tyr Ser Leu Glu Ala Gly Gly Lys Arg 
35 40 45 

He Arg Pro Leu Leu Leu Leu Ser Thr Val Gin Ala Leu Gly Lys Asp 
50 55 60 

Pro Ala Val Gly Leu Pro Val Ala Cys Ala lie Glu Met He His Thr 
65 70 75 80 

Tyr Ser Leu He His Asp Asp Leu Pro Ser Met Asp Asn Asp Asp Leu 
85 90 95 

Arg Arg Gly Lys Pro Thr Asn His Lys Val Phe Gly Glu Ala Met Ala 
100 105 110 

He Leu Ala Gly Asp Gly Leu Leu Thr Tyr Ala Phe Gin Leu He Thr 
115 120 125 

Glu He Asp Asp Glu Arg He Pro Pro Ser Val Arg Leu Arg Leu He 
130 135 140 

Glu Arg Leu Ala Lys Ala Ala Gly Pro Glu Gly Met Val Ala Gly Gin 
145 150 155 160 

Ala Ala Asp Met Glu Gly Glu Gly Lys Thr Leu Thr Leu Ser Glu Leu 
165 170 175 

Glu Tyr He His Arg His Lys Thr Gly Lys Met Leu Gin Tyr Ser Val 
180 185 190 

His Ala Gly Ala Leu He Gly Gly Ala Asp Ala Arg Gin Thr Arg Glu 
195 200 205 

Leu Asp Glu Phe Ala Ala His Leu Gly Leu Ala Phe Gin He Arg T^p 
210 215 220 

Asp He Leu Asp He Glu Gly Ala Glu Glu Lys He Gly Lys Pro Val 
225 230 235 240 

Gly Ser Asp Gin Ser Asn Asn Lys Ala Thr Tyr Pro Ala Leu Leu Ser 
245 250 255 
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Leu Ala Gly Ala Lys Glu Lys Leu Ala Phe His lie Glu Ala Ala Gin 
260 265 270 

Arg His Leu Arg Asn Ala Asp Val Asp Gly Ala Ala Leu Ala Tyr lie 
275 280 285 

Cys Glu Leu Val Ala Ala Arg Asp His 
290 295 



Claims 

1 . A process for production of a gene coding for a mutated farnesyldiphosphate synthase capable of synthesizing ger- 
anylgeranyldiphosphate synthase comprising the steps of: 

(1) subjecting a gene coding for a farnesyldiphosphate synthase to mutagenesis; 

(2) expressing the genes subjected to the mutagenesis: and 

(3) selecting a gene coding for a mutated farnesyldiphosphate synthase capable of synthesizing geranylgera- 
nyldiphosphate. 

2. A process according to claim 1 , wherein the gene coding for mutated farnesyldiphosphate synthase capable of syn- 
thesizing geranylgeranyldiphosphate is derived from Bacillus stearothermophilus. 

3. A gene coding for a mutated farnesyldiphosphate synthase capable of synthesizing geranylgeranyldiphosphate 
synthase, obtainable according to a process of claim 1 . 

4. A process for production of a mutated farnesyldiphosphate synthase capable of synthesizing geranylgeranyldi- 
phosphate. comprising the step of expressing a gene of claim 3. 

5. A mutated farnesyldiphosphate synthase capable of synthesizing geranylgeranyldiphosphate. obtainable accord- 
ing to a process of claim 4. 

6. A process for production of geranylgeranyldiphosphate or geranylgeranyol, comprising the step of acting a mutated 
farnesyldiphosphate synthase capable of synthesizing geranylgeranyldiphosphate on a substrate selected from the 
group consisting of isopentenyldiphosphate, dimethylallyldiphosphatep geranyldiphosphate and farnesyldiphos- 
phate. 

7. A geranylgeranyldiphosphate synthase having an amino acid sequence modified from a native amino acid 
sequence of a farnesyldiphosphate synthase wherein the modification comprises deletion of one to a few amino 
acid residues, addition of one to a few amino acid residues or replacement of one to a few amino acid residues with 
other amino acid residues, or a combination of said modification. 

8. A geranylgeranyldiphosphate synthase according to claim 7, wherein the modification is present on at least one of 
the positions 34, 59, 81, 157. 182, 239. 265 and 275 of farnesyldiphosphate synthase of Bacillus stearother- 
moDhrlus origin, or one to a few corresponding positions in an amino acid sequence of a farnesyldiphosphate syn- 
thase of other origin. 

9. A geranylgeranyldiphosphate synthase having an amino acid sequence shown in SEQ ID NO: 1 . 

10. A geranylgeranyldiphosphate synthase having an amino acid sequence shown in SEQ ID NO: 2. 

11. A geranylgeranyldiphosphate synthase having an amino acid sequence shown in SEQ ID NO: 3. 

12. A geranylgeranyldiphosphate synthase having an amino acid sequence shown in SEQ ID NO: 4. 
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13. A gene coding for a geranylgeranyldiphosphate synthase according to claim 7. 

14. A gene coding for a geranylgeranyldiphosphate synthase according to claim 8. 

15. A gene coding for a geranylgeranyldiphosphate synthase according to claim 9. 

16. A gene coding for a geranylgeranyldiphosphate synthase according to claim 10. 

17. A gene coding for a geranylgeranyldiphosphate synthase according to claim 1 1 . 

18. A gene coding for a geranylgeranyldiphosphate synthase according to claim 12. 

19. An expression vector comprising a gene according to daim 13. 

20. An expression vector comprising a gene according to claim 14. 

21. An expression vector comprising a gene according to claim 15. 

22. An expression vector comprising a gene according to daim 16. 

23. An expression vector comprising a gene according to daim 17. 

24. An expression vector comprising a gene according to daim 18. 

25. A recombinant host transformed with an expression vector according to daim 19. 

26. A recombinant host transformed with an expression vector according to daim 20. 

27. A recombinant host transformed with an expression vector according to daim 21 . 

28. A recombinant host transformed with an expression vector according to daim 22. 

29. A recombinant host transformed with an expression vector according to daim 23. 

30. A recombinant host transformed with an expression vector according to daim 24. 
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